CDS
Accession Number | TCMCG075C20805 |
gbkey | CDS |
Protein Id | XP_007026259.1 |
Location | complement(join(25791769..25791862,25791955..25792129,25792256..25792424,25792994..25793212,25793311..25793478,25793562..25793747,25793864..25793948,25794272..25794516,25794630..25794962)) |
Gene | LOC18597274 |
GeneID | 18597274 |
Organism | Theobroma cacao |
Protein
Length | 557aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_007026197.2 |
Definition | PREDICTED: U4/U6.U5 tri-snRNP-associated protein 2 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | Z |
Description | U4 U6.U5 tri-snRNP-associated protein |
KEGG_TC | - |
KEGG_Module |
M00354
[VIEW IN KEGG] |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] ko00002 [VIEW IN KEGG] ko03041 [VIEW IN KEGG] |
KEGG_ko |
ko:K12847
[VIEW IN KEGG] |
EC | - |
KEGG_Pathway |
ko03040
[VIEW IN KEGG] map03040 [VIEW IN KEGG] |
GOs |
GO:0005575
[VIEW IN EMBL-EBI] GO:0005622 [VIEW IN EMBL-EBI] GO:0005623 [VIEW IN EMBL-EBI] GO:0005634 [VIEW IN EMBL-EBI] GO:0005654 [VIEW IN EMBL-EBI] GO:0031974 [VIEW IN EMBL-EBI] GO:0031981 [VIEW IN EMBL-EBI] GO:0043226 [VIEW IN EMBL-EBI] GO:0043227 [VIEW IN EMBL-EBI] GO:0043229 [VIEW IN EMBL-EBI] GO:0043231 [VIEW IN EMBL-EBI] GO:0043233 [VIEW IN EMBL-EBI] GO:0044422 [VIEW IN EMBL-EBI] GO:0044424 [VIEW IN EMBL-EBI] GO:0044428 [VIEW IN EMBL-EBI] GO:0044446 [VIEW IN EMBL-EBI] GO:0044464 [VIEW IN EMBL-EBI] GO:0070013 [VIEW IN EMBL-EBI] |
Sequence
CDS: ATGATGAAAACAAAGAGGGATGATGGTAATAGTGAAGACTGGAGAGGGGACTCGAAACGGCGTAAAGTTGTGGATTGGCCATCTTCACCTTCTGAAGAGCCACTTGTGCCTTACAATGATGATGAAGATGATGAAAGGAGGGCATTGGGTCGCGTAGGCGGTGGAGAACAAGATGGTGAAGCAAGGTCAGAAGGGAATGGTCAGGGAGTTAAAAGTGAAGAAGATGAAGATGACAGTGATGATCCATATGGGCAAGGATCATTTCTGGGGAAGCAAAATCGCCAGGTTGAAGTACGAAGAGACTGCCCTTACCTTGATACTGTTAATCGCCAGGTGCTGGATTTTGATTTTGAGAGGTTTTGTTCGGTCTCTTTGTCAAATTTGAATGTGTATGCATGCCTGGTCTGTGGGAAGTATTACCAAGGAAGAGGGAAGAAGTCCCATGCTTATACTCATAGTCTAGAAGCAGGACATCATGTCTACATCAATCTTCGAACAGAGAAGGTGTATTGTCTTCCCGATGGGTATGAAATTAATGACCCATCATTGGATGATATACGCCATGTTCTAAATCCAAGGTTTACCAGAGAACAAGTTGAACAACTTGACAAGAACAAGCAATGGTCTAGAGCACTTGATGGTTCAGATTACCTTCCGGGAATGGTGGGGCTGAATAATATTCAAAAGACTGATTTTGTCAATGTCACAATTCAATCTTTAATGAGAGTTACTCCCTTAAGGAACTTTTTCCTTATCCCTGAAAATTACCAGCACTGTAAATCTCCACTTGTTCATCGATTTGGGGAACTCACACGAAAGATTTGGCATGCTCGAAACTTTAAAGGACAGGTTAGCCCTCATGAGTTTCTACAGGCAGTTATGAAAGCCAGTAAAAAACGGTTTCGGATAGGTGTGCAGTCTGAGCCTGTTGAATTCATGTCATGGCTTCTCAATACACTACATGCAAATCTAAGAACTTCAAAGAAAAGTAGCAGCATCATCCATAAGTGCTTTCAGGGGGAATTGGAGGTTGTAAAAGAGACACAGAACAAAGCTATCACTGAGAAGAAAGAAAGTGGTGAGGAACAAAATGGAGCTCCAAAAATTACAGATGGTGCAATTGAGAAGCATAATGTTGGTGCTGAAACTTACAGAATGTCCTTTTTGATGCTTGGATTGGATTTGCCAGAACCACCTCTTTTCAAAGATGTGATGGAGAAAAATATAATACCTCAGGTTCCTCTGTTCAATATACTGAAGAAGTTTGATGGTGAAACTGTAACAACTACAGTTCGTCCTCCAGCAAGGATGAGATATCGTGTCACCAGATTGCCACAGTACTTGATACTTCACATGGGCCGCTTTACTAGGAATAATTTCTTCAGAGAAAAGAACCCAACATTGGTGAACTTTCCTGTGAAAAACCTGGAGTTGAAGGACTACATTCCTCTGCCAGCACCAACCAAAGAGAATGAAAAGTTGCGCACCAAGTATGATCTGATTGCTAATATTGTTCACGATGGTAAGCCTGACGAGGGGTTCTACAGGGTCTTTGTACAGCGGAAGTCAGAAGAACTATGGTATGAGATGCAAGATCTGCATGTTTCTGAAACCCTTCCTCAGATGGTTGCACTGTCCGAAGCTTATATGCAGATATACGAGCAGCAACAGTAG |
Protein: MMKTKRDDGNSEDWRGDSKRRKVVDWPSSPSEEPLVPYNDDEDDERRALGRVGGGEQDGEARSEGNGQGVKSEEDEDDSDDPYGQGSFLGKQNRQVEVRRDCPYLDTVNRQVLDFDFERFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRHVLNPRFTREQVEQLDKNKQWSRALDGSDYLPGMVGLNNIQKTDFVNVTIQSLMRVTPLRNFFLIPENYQHCKSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGVQSEPVEFMSWLLNTLHANLRTSKKSSSIIHKCFQGELEVVKETQNKAITEKKESGEEQNGAPKITDGAIEKHNVGAETYRMSFLMLGLDLPEPPLFKDVMEKNIIPQVPLFNILKKFDGETVTTTVRPPARMRYRVTRLPQYLILHMGRFTRNNFFREKNPTLVNFPVKNLELKDYIPLPAPTKENEKLRTKYDLIANIVHDGKPDEGFYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYEQQQ |